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Abstract 

This paper designs hybrid analog-digital channel estimation and beamforming techniques for multiuser massive 
multiple input multiple output (MIMO) systems with limited number of radio frequency (RF) chains. For these 
systems, first we design novel minimum mean squared error (MMSE) hybrid analog-digital channel estimator 
by considering both perfect and imperfect channel covariance matrix knowledge cases. Then, we utilize the 
estimated channels to enable beamforming for data transmission. When the channel covariance matrices of all 
user equipments (UEs) are known perfectly, we show that there is a tradeoff between the training duration and 
throughput. Specifically, we exploit that the optimal training duration that maximizes the throughput depends on 
the covariance matrices of all UEs, number of RE chains and channel coherence time (Tc). We also show that 
the training time optimization problem can be formulated as a concave maximization problem for some system 
parameter settings where its global optimal solution is obtained efficiently using existing tools. In particular, when 
the base station equipped with 64 antennas and 1 RE chain is serving one single antenna UE, Tc = 128 symbol 
periods (T^) and signal to noise ratio of lOdB, we have found that the optimal training durations are ATg and 20Tg 
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for highly correlated and uncorrelated Rayleigh fading channel coefficients, respectively. The analytical expressions 
are validated by performing extensive Monte Carlo simulations. 

Index Terms 

Massive MIMO, Millimeter wave, RF chain. Hybrid channel estimation. Hybrid beamforming. 

I. Introduction 

Massive multiple input multiple output (MIMO) teehnology is one of the promising means for aehieving 
the extremely high energy and spectrum efficiency requirements of the future 5G networks [1]. To exploit 
the full potential of a massive MIMO system, one ean leverage the conventional digital arehitecture where 
all the signal processing is performed at the baseband frequency. In a digital architecture, a complete radio 
frequeney (RE) ehain is required for each antenna element at the transmitter and reeeiver, including low- 
noise amplifier, down-eonverter, digital to analog eonverter (DAC), analog to digital eonverter (ADC) 
and so on [2], [3]. Thus, when the number of base station (BS) antennas is very large (i.e., massive 
MIMO regime)\ the high eost and power eonsumptions of mixed signal eomponents, like high-resolution 
ADCs and DACs, makes it diffieult to dedicate a separate RE ehain for eaeh antenna [5]-[10]. Eor these 
reasons, deploying massive MIMO systems with limited number of RE ehains and hybrid analog-digital 
arehiteeture has reeently reeeived signifieant attention. In this arehitecture, the digital signal proeessing 
ean be realized at the baseband frequency using microprocessors whereas, the analog signal proeessing 
ean be enabled at the RE frequeney by employing low eost phase shifters [11]-[13]. 

Hybrid arehitecture can be deployed for single user and multiuser massive MIMO systems. The key 
difference between the analog and digital components of the hybrid arehitecture is that, as the analog part 
is realized using phase shifters (i.e., to reduce eost), eaeh having a eonstant modulus, it is more eonstrained 
than that of the digital arehiteeture. Eurthermore, in the multiuser system, the digital eomponent ean be 

*To the best of our knowledge, there is no strict definition on how large the number of antennas should be to be called as massive MIMO. 
However, in a multiuser setup, the term ’’massive MIMO” is used to reflect that the number of BS antennas is much larger than the number 


of user equipments (UEs) [4]. 
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designed for eaeh UE independently (i.e., it ean be unique to eaeh UE) whereas, the analog eomponent 
must be the same for all UEs (i.e., the analog part eannot be altered adaptively for each UE). In [14]-[16], a 
hybrid architecture is suggested for single user massive MIMO systems where matehing pursuit algorithm 
is utilized for hybrid bemforming [2]. In [17], a low eomplexity eodebook based RE beamforming based 
on multi-level RE beamforming and level-adaptive antenna seleetion is eonsidered. In [18], a hybrid 
beamforming design utilizing interleaved and side-by-side sub-arrays is proposed [19]. This design is used 
for adaptive angle of arrival (AoA) estimation and beamformings by utilizing two algorithms; differential 
beam traeking and beam seareh. 

In [20], hybrid preeoding seheme for multiuser massive MIMO systems is eonsidered. The paper 
employs the zero foreing (ZE) approaeh where it is designed to maximize the sum rate of all users. 
In [6], a beam alignment teehnique using adaptive subspaee sampling and hierarehieal beam eodebooks 
is proposed for millimeter wave (mmWave) eellular networks. In [21], a beam domain referenee signal 
design for downlink channel with hybrid beamforming arehiteeture is proposed to maximize the gain in 
a eertain direetion around the main beam. Hybrid analog-digital beamforming is proposed in [22] for 
downlink multiuser massive MIMO systems where total sum rate maximization problem is eonsidered. In 
[23], hybrid beamforming design for multiuser setup with frequeney seleetive ehannels is eonsidered. This 
paper also diseusses the required number of RE ehains and phase shifters sueh that the digital and hybrid 
beamforming designs aehieve the same performanee. In [24], joint ehannel estimation and beamforming 
design is eonsidered for single user mmWave MIMO systems. This paper employs a eodebook approaeh 
to design its trainings for uniform linear array (UEA) ehannel models. 

Channel state information (CSI) aequisition is an important aspeet of a (massive) MIMO system. 
In general, a wireless ehannel has a nonzero eoherenee time where the ehannel is treated as almost 
eonstant. In a typieal setup, increasing the training duration improves the channel estimation quality at 
the expense of redueed overall system throughput when the ehannel eoherenee time is fixed. Henee, there 
is a tradeoff between the training duration and system throughput. In most praetieal oases, the CSI needs 
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to be learned to maximize the system throughput. This motivates us to study the training-throughput 
tradeoff for massive MIMO systems with limited RE ehains while leveraging the hybrid arehiteeture. 
The eurrent paper partieularly aims at determining the optimal training duration sueh that the system 
throughput is maximized. As ean be understood from the above diseussions, there are a number of 
researeh works utilizing hybrid arehiteetures. However, none of the aforementioned works eonsider the 
problems studied in the eurrent paper. Eor example, the works of [14]-[16], [20], [22], [23] foeus on the 
design of hybrid beamforming under the assumption of perfeet CSI. Eurthermore, the works of [18], [24] 
eonsider both ehannel estimation and beamforming with hybrid arehiteeture without taking into aeeount 
the ehannel eoherenee time (i.e., the ehannel estimation duration derived by the aforementioned works 
may not neeessarily maximize the system throughput). 

Given these diseussions, however, the study of training-throughput tradeoff has been eondueted in the 
eonventional MIMO systems where the number of RE ehains is the same as that of antennas. In [25], 
the training seheme for single user bloek fading ehannel under low signal to noise ratio (SNR) regions is 
eonsidered [26]. Eor the Rayleigh fading ehannel, the paper addresses on how long the ehannel eoherenee 
time be sueh that the lower bound mutual information behaves eloser to the eapaeity obtained when perfeet 
CSI is available at the reeeiver. In [27], the ergodie eapaeity of ultra wideband eommunieations having 
sparse multipath ehannels for single user system at low SNR regions is examined. In [28], the strueture 
of the optimal eapaeity aehieving input matrix is derived. This paper also shows that there is no eapaeity 
gain by deploying the number of transmit antennas to be more than the eoherenee interval (in symbols) 
of the ehannel. In [29], the noneoherent MIMO eapaeity in the high SNR regime is examined where it is 
shown that the number of transmit antennas required need not be more than half of the eoherenee interval 
(in symbols) in this regime. 

In [30], the authors eonsider multiple antenna eommunieations over a wideband noneoherent Rayleigh 
bloek fading ehannel. This paper examines the eapaeity of a MIMO system with an average power 
eonstraint, and eonsiders its relationship with the ehannel eoherenee time, number of transmit and reeeive 
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antennas and SNR. The relation between wideband eapaeity and ehannel sparsity is studied in [31]. The 
work of [32] eomputes the optimal number of UEs that ean be served in a given ehannel eoherenee time. 
Erom these explanations, we ean understand that the works of [25]-[28], [30], [31] foeus on the eapaeity 
eharaeterization of MIMO systems for different SNR regions and ehannel eharaeteristies. On the other 
hand, [32] tries to determine the optimal number of UEs for the given ehannel eoherenee time. However, 
in the eurrent paper, we have examined the optimal training duration using hybrid arehiteeture for fixed 
number of UEs and ehannel eoherenee time. We would like to emphasize here that one may still think to 
direetly use the solution derived in [32] for the setup of the eurrent paper. However, as will be shown in 
Seetion VI-A and demonstrated in the simulation seetion, sueh approaeh leads to the sub-optimal solution, 
and the throughput gap is signifieant in some parameter settings. 

The training-throughput tradeoff ean be studied by taking into aeeount different settings and assump¬ 
tions. As massive MIMO systems have signifieant interest at millimeter wave frequeneies [6], [14], [21], 
[33], one may be interested in examining the training-throughput tradeoff for these frequeneies only. Eor 
instanee, by exploiting the eharaeteristies of mmWave ehannels whieh are often speeular and of very low 
rank. However, massive MIMO systems still have praetieal interest at mierowave frequeney bands [1]. Eor 
this reason, we examine the training-throughput tradeoff for multiuser massive MIMO systems with limited 
RE ehains where the ehannel of eaeh UE has arbitrary rank. It is assumed that the number of UEs are fixed, 
the BS is equipped with a massive antenna array and eaeh UE is equipped with single antenna, with flat 
fading ehannels and TDD based ehannel estimation. Under these assumptions, first, we design novel TDD 
minimum mean squared error (MMSE) based hybrid analog-digital ehannel estimator by eonsidering both 
perfeet and imperfeet ehannel eovarianee matrix eases. Then, we utilize the estimated ehannels to exploit 
beamforming during data transmission. Einally, we study the training-throughput tradeoff for the seenario 
where the ehannel eovarianee matriees of all UEs are known perfeetly. The extension of the proposed 
designs for the ease where eaeh UE is equipped with multiple antennas, and experienees a frequeney 
seleetive ehannel has also been diseussed briefly. The main eontributions of the paper are summarized as 



Main manuscript: Accepted in IEEE Transaetions on Communieation 


follows: 

• Eirst, we propose ehannel estimation for multiuser massive MIMO system with perfeet and imperfeet 
ehannel eovarianee matriees by assuming that the number of RE chains is the same as that of 
antennas. When the eovarianee matrix is not known perfeetly, the global optimal solution of our 
ehannel estimator is obtained by utilizing eonvex optimization approaeh. Seeond, we exploit these 
estimators to design the hybrid analog-digital ehannel estimator and beamformings for the seenario 
where the number of RE ehains is less than that of antennas. As will be elear later in Seetion II, 
unlike the approaeh of [24], the proposed hybrid ehannel estimator and digital beamforming design 
ean be applied for any ehannel model. 

• When the eovarianee matriees of all UEs are known perfeetly, we study the training-throughput 
tradeoff for the multiuser massive MIMO systems having limited number of RE ehains. We exploit the 
faet that the optimal training duration that maximizes the overall network throughput depends on the 
operating SNR, available number of RE chains, channel eoherenee time (Tc) and eovarianee matriees 
of all UEs. Speeifieally, we show that the training time optimization problem ean be formulated as a 
eoneave maximization problem for some system parameter settings where its global optimal solution 
ean be obtained effieiently using existing tools. 

• We have validated the analytieal expressions by performing numerieal and extensive Monte Carlo 
simulations. In partieular, when the base station equipped with 64 antennas and 1 RE ehain is serving 
one single antenna UE, and SNR=10dB, Tc = 128 symbol periods (T^), we have found that the optimal 
training durations are ATg and 2 OT 5 for highly eorrelated and uneorrelated Rayleigh fading ehannel 
eoeffieients, respeetively. 

This paper is organized as follows: Seetion II introduees the system model and problem statement. 
Seetion III - V diseusses the proposed ehannel estimation and transmission approaehes when the number 
of RE ehains is the same as and less than that of antennas. Numerieal and eomputer simulation results 
are presented in Seetions VI-A and VI-B. Einally, eonelusions are drawn in Seetion VIII. 
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Fig. 1. The considered channel estimation and transmission frame stracture. 


Notations: In this paper, upper/lower-ease boldface letters denote matriees/eolumn veetors. tr(X), 

X^ and E(X) denote the (i, j)th element, trace, conjugate transpose and expected value of X, respeetively. 
We define ||X|||. as tr{X^X} and ||X ||2 as the square root of the maximum eigenvalue of X^X, I is 
an identity matrix with appropriate size, and represent spaees of M x M matriees with 

eomplex and real entries, respeetively. The next integer greater than or equal to x, optimal x and max(a;, 0) 
are denoted by \x~\, x* and [x]+, respeetively. The aeronyms blkdiag(.), s.t, rem and i.i.d denote ’’bloek 
diagonal”, ’’subject to”, ’’remainder” and ’’independent and identieally distributed”, respeetively. 

II. System model and problem statement 

This section discusses the system model and problem statement. We eonsider a multiuser system where 
a BS having N antennas and Nrf RE chains is serving K decentralized single antenna UEs. A TDD 
based ehannel estimation is adopted where the ehannels of all UEs are estimated at the BS in the uplink 
ehannel, and are utilized for both the uplink and downlink data transmission phases. Under sueh settings, 
one ean have a ehannel estimation and data transmission frame strueture as shown in Eig. 1. 

As we ean see from this figure, one block incorporates L frames for channel estimation and data 
transmission, where the duration of each frame is the same as the coherence time of the channel = f3Ts. 
In eaeh frame, aTg is used for channel estimation and the remaining {l3—a)Ts is used for data transmission. 
In a typieal massive MIMO system, the transmitted signal is reeeived by an antenna array eaeh deseribed 
by the number of seatterers, AoA, angle of departure (AoD), delay, and fast fading parameter. Unlike the 
fast fading, the angles and delays are almost stationary over a eonsiderable number of frames [34], [35]. 
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This motivates us to assume that the /cth UE has a eonstant channel covariance matrix R^., whieh does 
not depend on the fast fading eomponent, in the L frames of Eig. 1. Under this assumption, the ehannel 
between the kth UE and BS in eaeh frame ean be expressed as 

where the entries of G are modeled as i.i.d zero mean eireularly symmetrie eomplex Gaussian 
(ZMCSCG) random variables each with unit variance, and R^ G is a positive semi-definite matrix. 

Note that mmW ehannels are expeeted to be speeular, eonsequently R^ will tend to be low rank. 
However, at mierowave frequeney bands, the rank of R^ will be likely high. Nevertheless, as the eurrent 
paper does not assume any speeial strueture on the ehannel, the above statistieal ehannel model ean be 
utilized both for mmWave and mierowave frequeney band applieations. Eor mierowave frequeney regions, 
the assumption of perfeetly (imperfeetly) known R^ is eommonly adopted in the existing literature. 
The Doppler shift seales linearly with frequeney, thus, the eoherenee time of mmWave bands is an 
order of magnitude lower than that of eomparable mierowave bands. In a typieal setting, for example, 
one may experienee ehannel eoherenee times of 500/iS and 35/iS when the system is deployed at the 
earner frequencies 2GHz and 28GHz, respeetively [36], [37]. Eurthermore, the fading ehannel statisties 
beeomes wide-sense stationary (WSS) when the seattering geometry relative to a given user remains 
unehanged/varying slowly [38]. This validates that the eovarianee matriees of mmWave ehannels will also 
be almost eonstant for a eonsiderable symbol duration [38], [39]. On the other hand, different ehannel 
eovarianee matrix estimation methods are reeently proposed for millimeter wave ehannels in [40] by 
utilizing hybrid analog-digital arehiteeture. These motivate us to eonsider the following objeetives: 
Objectives of this paper: 

1) The first objeetive of this work is to design the ehannel estimation algorithm by assuming that R^ 
is known perfeetly and imperfeetly, and Nrf < N. 

2) The ultimate goal of the ehannel estimation step is to enable reliable data transmission on (/3 — a)Ts 
durations. Eor this reason, the seeond objeetive of this work aims on the design of a sueh that 
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the average throughput of eaeh frame is maximized for the given T^, /3 and Njip. This problem is 
mathematieally formulated as^ 

max Th (1) 

a 

where Th is the total throughput obtained in eaeh frame. To the best of our knowledge, when 
is known imperfeetly, the exaet throughput expression is not known, and its eomputation requires 
eonsidering signifieant bounds and expeetations [43]. Therefore, we examine (1) and study the 
training-throughput tradeoff when Npp < N and perfeet seenario only^. 

Note that in some praetieal setup, the total energy available for both training and data transmission 
tasks remain fixed on the basis of the overal SNR level. Eor sueh a ease, the training throughput tradeoff 
problem turns out to be on how to split the available energy into the training and data transmission 
phases. However, the training throughput tradeoff studied in the current paper eonsiders different deviees. 
Speeifically, the training signal is transmitted from the UEs only (i.e., no training data is transmitted from 
the BS). And, it is not elear how one ean assume fixed total energy that will be shared by the training 
and data eommunieation phases for our TDD based ehannel estimation, and data transmission in both the 
uplink and dowlink ehannels. Eor these reasons, we have tried to eompute the optimal training durations 
(in terms of symbol period) for maximizing the throughput while keeping the total eoherenee time and 
SNR levels eonstant (i.e., to solve (1)). Nevertheless, one ean still utilize the idea of fixed total energy 
by eonsidering both training and data transmission tasks in the uplink ehannel only where these tasks are 
performed by the UEs (or in a frequeney division duplex (EDD) system, in general, where training and 
data signals are transmitted from the same souree). And, sinee energy and SNR are interrelated quantitative 
terms, we believe that eonsidering the training-throughput problem in terms of SNR (energy) will lead to 

^Note that the current paper assumes that the channel covariance matrices of all UEs are available at the BS. Furthermore, the coherence 
time of the considered channel is known a priori. Although recent results at mm Wave frequency bands support these assumptions for some 
parameter settings (see for example, [36], [37], [41], [42]), further measurement results are required to validate the considered assumptions 

for all practically relevant deployment scenarios. 

^Studying the training-throughput tradeoff for imperfect Rfc is left for future research. 
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the same result. 


III. Channel Estimation when Nrf = N 

In this seetion, we provide the proposed ehannel estimation approaehes for the ease where Nrf = N. 
Eor some environments, the eovarianee matrix of eaeh UE is known a priori. Eor example, an i.i.d 
Rayleigh fading ehannel is eommonly adopted in an urban environment (i.e., = I); an exponential 

eorrelation model with eorrelation eoeffieient pk is also used to model the eovarianee matrix of eaeh UE 
[44]. Eurthermore, in a UEA ehannel model with seetorized antenna and predefined array geometry, the 
eovarianee matrix of eaeh UE ean also be known [45], [46]. However, the eovarianee matrix of eaeh UE 
may not be known for a general eommunieation environment. In sueh a ease, it may need to be estimated 
from the reeeived signal whieh introduees some error. In addition, as the eurrent paper assumes a hybrid 
arehiteeture with N^f ^ N RE ehains, it exploits less degrees of freedom (eompared to the system 
equipped with full number of RE ehains) to estimate the eovarianee matriees of all UEs reliably in a 
praetieally relevant bloek duration. Consequently, the effeet of ehannel eovarianee matrix estimation error 
eould be prominent. These motivate us to design ehannel estimators for the following two eases: 

1) Case I: In this ease, it is assumed that is perfeetly known a priori. 

2) Case II: In this ease, we assume that R^'^^ = R^'^^ + with ||Afc ||2 < e^, where R^'^^ is the 
estimate of R^'^^ and is the maximum error bound"^. 

A. Channel Estimation for Case I 

As explained in Seetion I, the ehannel estimation of a massive MIMO system is performed in the uplink 
ehannel. During the uplink training phase, the BS will reeeive the following signal at symbol period j. 

K 

ytj = 

i=l 

where K is the number of UEs, p*j is the pilot symbol of the Ah UE at symbol time j, e is the 
reeeived noise veetor at symbol time j and hj G is the ehannel between the Ah UE and BS. 

"^We would like to mention here that such bounded error parameter estimation approach is termed as worst case design [47]. 
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When Njip = N and the eoherenee time of the ehannel is Tc, ehannel estimation is performed just by 
employing r = K symbol periods (T^) and data transmission is performed on the remaining Tc — t time 
durations [32], [48]. By spending K training symbols, the BS reeeives the following signal 

Yt =HP^ + Ni 

where H = [hi, ha, • ■ ■ , h^^], Y* = [yti, yta, • ■ ■ , Yik], P = [Pi, P 2 , •'' , Vk], = [wi, n^a, • • ■ , ntx] is 
the additive noise whieh is modeled as N* CA/'(0, all) and p* = [pii,Pi 2 , • ■ ■ For simplieity, the 

entries of P are seleeted from a eonstellation satisfying P^P = I. With this training, the ehannel estima¬ 
tion phase follows two steps. The first step is used to deeouple the ehannels of eaeh UE independently 
and the seeond step is used to get the estimated ehannel of eaeh UE from the deeoupled terms. 

In the first step, the BS ean deeouple the ehannels of eaeh UE by multiplying the overall reeeived 
signal Y with P, i.e., 

E ^ YiP = H -f NiP ^ Ofc = h, + hfc, (3) 

where h^ = N^p^. As we ean see from (3), e^. does not eontain hj,i 7 ^ k. In the seeond step, the 
well known MMSE estimator is employed to estimate the kth ehannel from e^. This is performed by 
introdueing an MMSE matrix for the kt\\ UE and expressing the estimated ehannel as 

hfc = Wf Ofc, (4) 

where W^. is designed sueh that the MSE between and is minimized as follows 

ik = tr{E{|hfc - hfcp}} = tr{E{|Wf (hfc + hk) - h^p}} 

= tr{(Wf - I^)Rfc(Wf - Iiv)^ + W4. (5) 

The optimal is obtained from the gradient of ^k as 

^ = 0 ^ = (R, + (r?I)-‘R». (6) 

Note that when Npp = N and is known perfeetly, the above ehannel estimation approaeh is 
eommonly adopted in the existing literature and ean be eonsidered as an existing approaeh [32], [48]. 
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B. Channel Estimation for Case II 

In this subseetion, we provide the proposed ehannel estimation approaeh when is not known perfeetly 
(i.e., Case II). As we ean see from the above subseetion, the ehannel estimation phase has two main 
steps when is known perfeetly. The first step (i.e., user deeoupling) ean still be used even if R^ is 
not known perfeetly. However, the seeond step depends on the knowledge of R^. Thus, the remaining 
issue will be on how to design W^. sueh that of (5) is minimized under imperfeet R^. To this end, we 
propose robust eonvex optimization approaeh as follows. 

min ||(Wf - lAr)(Rf + Afc)||| + aMwfWfc}, s.t ||< 6^. (7) 

Wfe 

This problem can be equivalently expressed as [49] 

min tr{Dfc} + a^trjWfWfc} (8) 

S.t II Afcih < Cfc, (Wf - I^)(R;/' + A,)(R;/' + A,)^(W, - I^) A Dfc, A 0 

where A 0 denotes positive semi-definite. Using Sehur Complement, the last inequality beeomes [49] 

I (Rf + Afc)^(Wfc - I^) 

A 0. 

(W" - + AO D* 

This inequality ean also be expressed as 

Afc A Bf AfcCfc + (Bf A,C,)^ (9) 

where 

I teJ^Wk-lN) 

Ak= , Bfc = [0, (Wfc-I^)], Ck= [-1, 0]. (10) 

(Wf - I^)Rf D, 

Thus, the last eonstraint of ( 8 ) ean be replaeed by (9). To solve ( 8 ), we consider Lemma 1 below. 
Lemma 1: Eet A, B and C are given matriees with A = A^, the relation 

A A B^AC + (B^AC)^, A ; ||A ||2 < e 


( 11 ) 
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exists if and only if there exists A > 0 and 

A-AC^C -eB^ 

A 0. 

-eB AI 

Proof: See Proposition 2 of [50]. ■ 

Using Lemma 1, problem (8) ean be reformulated as 

Afc-AfcCfCfc Wf 

^ 0 , 

—CfcBfc Afcl Wfc I 

where A^, B^ and are as defined in (10). This problem is the well known semi-definite programming 
problem where its global optimal solution ean be found by applying interior point methods [49]. 

We would like to mention here that problem (7) has ’’similar” mathematieal strueture to that of the 
worst ease robust MIMO beamforming problem (see for example (39) of [47] and (25) of [51]). Thus, 
the approaeh used to solve (7) ean be eonsidered as the modified versions of the teehniques used in [47], 
[51]. Eurthermore, as will be detailed in the next subseetion, we have exploited the solution of (7) to 
handle the seenario where the number of RE ehains is smaller than those of antennas. 

C. Low Complexity Channel Estimator 

When Rfc is known perfeetly, the eomplexity of (6) is the same as that of matrix inversion whieh is 
0{N^). On the other hand, under imperfeet R^, (12) has + 1 real optimization variables, and two 
semi-definite eonstraints with 6N and 4A^ dimensions. The worst ease eomputational eomplexity of (12) in 
terms of number of iterations is upper bounded by 0{y/6N + 4:N) where the eomplexity of eaeh iteration 
is on the order of 0((6A^^ -f 1)^((6A^)^ -f (4A^)^)). Henee, the overall eomplexity of solving (12) is upper 
bounded by 0{s/lON{6N‘^ + ^ [52]. Although these eomplexities have polynomial 

order, they may be expensive espeeially when N is very large (i.e., in massive MIMO systems). 

In a typieal indoor massive MIMO (mmWave) eommunieation, R^ will often tend to be a very low 
rank matrix. Thus, one approaeh of redueing the eomplexity of the ehannel estimators is to exploit the 


AO (12) 
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rank of whieh is explained as follows. Erom the eigenvalue deeomposition of R^, we will have 
Rfc = QfcAfeQ^, where G with Lk as the rank of R^, and we ean reexpress as 

hfc = Qfchfc, Qf hfc = hfc (13) 

where G is modeled CA/'(0, A^). As we ean see, is the produet of and h^.. 

The main idea of redueing the eomplexity of the ehannel estimators is to first estimate and then to 
multiply the estimated value just by Qfc. Using (3) and (13), we ean have 

Ofc = Qfehfc + fifc, Qf Ofc = hfc + Qf fifc. 

By introdueing W^, one ean express the estimated as 

fifc = Wf Qf Ofc. (14) 

When Rfc are known perfeetly, W^. ean be designed using the MMSE method like in (6). When R^ 
is not known perfeetly and is multipled by Q^, the error eovarianee beeomes Erom the 

Cauehy-Sehwarz inequality, we will have 

||Qfc^Afc||2 < ||Qf^||2|| Afc||2 = ||Afc||2 < €k- (15) 

As the error bound is still maintained in sueh a ease, the optimal Wjf can be obtained similar to (12) 
when Rfc is not known perfeetly. 

Onee is designed, the estimated ehannel ean be eomputed as 

hfc Qfchfc. (16) 

Erom the above explanation, we ean understand that the eomplexity of the ehannel estimation (16) becomes 
0{L\) and 0{Ll-^) for perfeet and imperfeet R^, respeetively. And in praetiee as ^ N, this ehannel 
estimation approaeh ean reduee the eomplexity signifieantly. 

Note that the ehannel estimators of this seetion are designed by assuming a = K. However, in some 
eases, there eould be an interest to design the ehannel estimator for more general training duration sueh 
as a = 6K, where 0 is a positive integer. Eor sueh a ease, the same pilot symbols as those of this seetion 
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ean be reused for K conseeutive symbol periods, and utilize linear combination approach. By doing so, 
one can obtain similar structure as of (3) but the effeetive noise varianee is redueed by a faetor of 6. 
This shows that the estimation teehniques of this seetion ean be applied straightforwardly for such cases. 

Design perspective: As explained in the previous seetion, the ultimate goal of the ehannel estimator 
is to enable data transmission. In this regard, two design approaehes are eommonly adopted, one is by 
assuming perfeet CSI and the other is by eonsidering imperfeet CSI. Perfeet CSI ean almost be aehieved 
when Rfc is known perfeetly and aj 1 (i.e., at very high SNR). In all other eases, perfeet CSI ean never 
be aehieved. Thus, in general, the transmission phase should also take into aeeount the effeet of imperfeet 
CSI espeeially due to the imperfeet R^.. However, designing the transeeivers by assuming imperfeet CSI is 
not trivial and it requires a robust approaeh for eaeh frame duration whieh is almost infeasible partieularly 
for large antenna array systems (see for example [43], [47], [53], [54]). Eor this reason, the estimated 
ehannel obtained by utilizing the of (6) and (12) or of (14) is treated as perfect channel for data 
transmission phase^. As we ean see from Eig. 1, the MMSE ehannel estimator Wfc(Wfc) is designed only 
onee in eaeh bloek. This validates that the eonsidered MMSE ehannel estimators have praetieal interest. 

Onee the ehannel estimation phase is aeeomplished, the next phase will be to perform data transmission. 
As diseussed above, data transmission may take plaee in both the uplink and downlink ehannels. During 
data transmission phases, the BS exploits antenna arrays to design the appropriate beamforming matriees. 
In the eurrent paper, we employ the well known ZE approaeh in both the downlink and uplink transmission 
phases. This is beeause ZE approach is optimal in a rich scattering environment and easy to implement 
for massive MIMO systems [4], [23]. When R^. is known perfeetly, = N and given K, the optimal 
solution of (1) is a = Tf [32]. 

In the following, we diseuss the ehannel estimation and data transmission phases when Npp < N. 
As will be elear in the sequel, the ehannel estimator of this seetion is exploited to design the ehannel 
estimator and study the training-throughput tradeoff when Nrf < N. Eor better flow of the text, we first 

^We are particularly interested for high SNR regions 
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start with single user case and then we proeeed to multiuser. 

IV. Channel Estimation and Training-Throughput Tradeoff for Single User when 

Nrf < N 

This seetion diseusses the ehannel estimation and studies the tradeoff between the training duration and 
throughput when Nrf < N for the single user ease. 


A. Channel Estimation 

Eor the single user setup, the BS will reeeive the following signal in all antennas during the training 
duration (i.e., (2) with K = 1). Throughout this seetion, we have removed the subseript 1 for readability. 

y = hp* + n 

where p is chosen as |pp = 1. As mentioned in the introduction section, the hybrid analog-digital channel 
estimator eonsists of the analog and digital part, where the analog part is realized at the RE frequeneies. 
Assume that we employ a trainings to perform ehannel estimation and Uj G is used as the 

analog ehannel estimator in the Ah pilot symbol. With this matrix, the reeeived samples after ADC in the 
Ah pilot symbol ean be expressed as 

y. = Uf(h/ + n,). (17) 

By multiplying this signal with p, we will get 

hi =pyi = Uf(h-f Hip). (18) 


Now if we employ a pilot symbols for training, we will have 


h = U^h -f n 


(19) 


where h = [hf, h^, • • ■ , h^]^, U = [Ui, U 2 , • ■ ■ , U^], n* = Uf n^p and n = [nf, n^, • • ■ , n^]'^. Einally, 
we utilize the digital ehannel estimator W to estimate h from h as follows 


h = W^ir. 


(20) 
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By doing so, we will get the following MSE ^ 

^ =tr{E{|h - h|2}} = tr{(W^U^ - Ijv)R(W^U^ - I^v)^ + W^R,-,W} (21) 

where R is the eovarianee matrix of h, Rjj = E{nn^}. Thus, when R is known perfeetly, our objeetive 
will now be to design W and tj to solve 

min||(W^U^-I^)yR||| + W^RsW. (22) 

w,u 

And when R is not known perfeetly, we will have the following problem 

min||(W^U^-Ijv)(R^/^ +A)||| + W^R^W, s.t ||A ||2 < e. (23) 

w,u 

In general, the problems (22) and (23) are non-eonvex. However, for fixed U(W) these problems are 
eonvex. Therefore, one approaeh of solving sueh problems is to iteratively optimize one variable while 
keeping the other eonstant. In a massive MIMO system, as N is typieally large, solving these two problems 
iteratively is eomputationally expensive. Eor this reason, we preseleet tj from the solution of Seetion III 
and then we optimize W for fixed tj. The question now is how do we seleet tj? 

In the following, we provide the proposed method to seleet tj when R is known perfeetly^. By 
eomputing the singular value deeomposition (SVD) of W* in (6), one ean get 

W* = UDV^ (24) 

where D is a diagonal matrix eontaining the non-zero singular values of W* arranged in deereasing order. 

Aeeording to the detailed results diseussed in [23], when eaeh of the elements of tj has a modulus 
less than or equal to 2, this matrix ean be implemented using phase shifters in two approaehes. The first 
approaeh assumes the availability of suffleiently high resolution (or theoretieally infinite) phase shifters 
(i.e., as in Seetion IV.A of [23]) whereas, the seeond approaeh assumes to utilize switches and eonstant 
phase shifters (i.e., the phases of eaeh phase shifter is fixed a priori as in Seetion IV.B of [23]). Given the 
limitations of the eurrent eleetronie teehnology, it is almost infeasible to have low eost and suffleiently 

*The same approach can also be applied for imperfect R. 
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high resolution phase shifters. Eor this reason, we believe that the seeond implementation approaeh is 
realistie (see also the diseussions in Seetion VI-B). 

As eaeh of the elements of U has a maximum modulus of 1, tj ean be ehosen as the first aNjip 
eolumns of U. Erom these diseussions, one ean understand that the solution obtained in Seetion III is 
useful for designing the hybrid analog-digital ehannel estimator with Npp < N. 

Optimality: If rank(W*) < Npp and a = 1, then the optimal solution of (22) beeomes U = U and 
W = DV^. This is due to the fact that the objeetive funetion of (22) is lower bounded by the minimum 
value of (5). However, as rank(W*) < Npp may not hold always, ehoosing tJ as the first aNpp eolumns 
of U is not optimal for an arbitrary parameter settings. 

Eor the given tj, (22) and (23) ean be solved exaetly as in Seetion III. After solving (22), we will have 
the following normalized MSE 

f = (25) 

Note that one ean preseleet tj different from the eurrent paper. However, as the foeus of the paper is 
to study the training-throughput tradeoff of the proposed hybrid analog-digital arehiteeture, we utilize this 
tj. And as will be detailed in the next subseetion, the approaeh of examining the tradeoff between the 
training duration and its eorresponding throughput ean still be extended for other ehoiees of tj. 

B. Training-Throughput Tradeoff 

In this seetion, we study the training-thoughput tradeoff for single user ease when Npp < N and R is 
known perfeetly. Data transmission ean take plaee either in the downlink or uplink ehannel. As the BS 
has antenna arrays, it applies beamforming both in the downlink and uplink ehannels. By doing so, we 
will have the following estimated signals 

X =h^ba; -f Downlink channel, x = a.^{hx -f n^;) Uplink channel (26) 

where x is the transmitted signal whieh is assumed to have E|a;p = 1, 5; is the estimate of x, b (a^) is 
the preeoding (deeoding) veetor, and rid and are additive noises in the downlink and uplink ehannels. 
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respeetively with ~ CA/’(0, crj) and ~ CA/’(0, crjl). 

Note that the eurrent paper examines both the training and data transmission phases. In some eases, it 
may be required to utilize different SNRs in these two phases. Speeifieally, the ehannel estimation phase 
may need to have high SNR to obtain better performanee in the beamforming phase. One approaeh of 
eontrolling the SNR is by assigning different transmission powers in these two phases (i.e., the noise 
varianee is fixed), and the other approaeh is to set the same transmission powers in both training and 
data transmission phases, and assume different noise varianees. In the eurrent paper, we have followed the 
latter approaeh just for mathematieal eonvenienee and better flow of the text. Nevertheless, in praetiee, the 
noise varianee would be the same both in the ehannel estimation and beamforming phases. The desired 
SNR levels will, therefore, be maintained just by eontrolling the transmission power. 

When K = 1, the downlink and uplink ZE approaehes turn out to be maximum ratio transmission 
(MRT) and maximum ratio eombining (MRC) approaehes, respeetively whieh ean be designed as [23] 

h=Kh DL-MRT, a = UL - MRC (27) 


where n = (E{h^h})“^/^ is ehosen to ensure that the average transmitted power in the uplink and 
downlink ehannels are the same (i.e., unity), and DE-MRT and UE-MRC denotes the downlink MRT and 
uplink MRC approaehes, respeetively. 

Aeeording to [23], the transmitter and reeeiver veetors b and a ean be implemented with a hybrid 
analog-digital arehiteeture without performanee loss when Nrf > K. Eor this reason, N^f = K = 1 h 
suffieient for the transmission phase. Under this setting, the downlink and uplink average SNRs beeome 

tP + t 


E{0'} DL - MRT. E{o^} = 


^nd\‘ 




E|a^nrf| 




UL - MRC 


(28) 


where 


E{h^h} = f, E{|h^Wnp} = f, E{h^WU^h} = t 


(29) 


with t = tr{RM}, i = a^trjW^RW}, t = tr{MRM^} + cr^trlW^W} and M = W^U^. In 
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partieular, when ^ 1 and h is an i.i.d Rayleigh fading ehannel with R = I, we will have 


E{7} 




,tr{U«R^U(U»RU)-} _ a _ 




CTa 


(30) 


With the above average SNRs, we get the following average normalized throughput^ 


Th =E^ log2(l + ^) I < (ZZ_^ log,(l + E{7}) 


Tr 


Tr 


(31) 


where the seeond inequality employs Jensen’s inequality as log(l + x) is a eoneave funetion [55]^. 

As we ean see from (28), Ejy} depends on t, i and t whieh are funetions of tj. This validates that Ejy} 
depends on the training duration a. In partieular, inereasing a inereases Ejy} (easily seen from (30)) but 
it deereases (Tc — aTs). This shows that the average throughput of the eonsidered hybrid arehiteeture is 
not neeessarily the same for all training periods. Henee, a must be optimized to solve (1) as 

max^^^—Z^log2(l+ E{7}). (32) 


To get better insight about the optimal a, we eonsider the following lemma. 

Lemma 2: When Nrp = 1, MRT (MRC) beamforming is employed and eaeh of the elements of h is an 
i.i.d Rayleigh fading, (32) is strietly eoneave optimization problem where the optimal a is unique and ean 
be obtained by simple biseetion seareh method. Eurthermore, for an arbitrary h with known eovarianee 
matrix R with ^ ^ ol and Ejy} 3> 1 (i.e., at very high SNR), (32) is also a eoneave 

optimization problem where the optimal a ean be found from 


{13 - a) log 2 [/(a + 1)] - (/3 - a + 1) log 2 [/(a)] = log 2 ( 7 max) (33) 


where gi are the eigenvalues of R sorted in deereasing order (i.e., gi > g 2 >,■ ■ ■ ■,> fl'Af),7max = 

Roi) = YR=i and J with 0 < 5* < 1. 

Proof: See Appendix A ■ 

It is evident that 'ymax inereases as the spatial eorrelation of the UE’s ehannel inereases. Erom (33), 
we ean thus notiee that the optimal a deereases as the ehannel eorrelation inereases. In other words, 

’since £{ 7 } is the same in both the downlink and uplink channels, the average normalized throughput is the same in both of the channels. 
*As will be demonstrated in the simulation section, this bound is tight. 
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maximum training duration is required when the UE’s channel eoeffieients are independent. This faet has 
also been validated in the numerieal and simulation seetions. 

As ean be seen from Eig. 1, both ehannel estimation and data transmission operations are exeeuted in 
eaeh frame. In the — r duration, data transmission ean be performed by three approaehes. In the first 
approaeh, eaeh of the Tc—r duration is partitioned for both uplink and downlink ehannel data transmissions. 
In the seeond approaeh, either of the uplink or downlink data transmission takes plaee alternatively in 
each frame. The third approaeh eould be a eombination of the first and seeond approaehes. In all these 
approaehes, sinee Ejy} is the same in the downlink and uplink ehannels, the optimal a of (32) is the 
same for all of these three transmission approaehes. 

V. Channel Estimation and Training-Throughput Tradeoff for Multiuser when 

^RF < ^ 

The key differenee between the single user and multiuser eases is that the former does not experi- 
enee interferenee while the latter does. This interferenee arises both during ehannel estimation and data 
transmission phases. In the following, first we provide our ehannel estimation approaeh for multiuser 
system employing hybrid analog-digital arehiteeture, and then we utilize the estimated ehannel to study 
the tradeoff between training duration and throughput. 

A. Channel Estimation 

Eike in Seetion IV, the ehannel estimation phase ean be performed both for perfeet and imperfeet 
oases. To remove the effeet of interferenee between the ehannel veetors of K UEs, we assume that a 
is a multiple of K. And under this setting, the ehannel of eaeh UE ean be deooupled by allowing K 
orthogonal pilots for fixed analog ehannel estimator part. In this regard, we suggest to fix tjj (i.e., the 
analog ehannel estimator matrix) oonstant for the Ah set of K trainings. By doing so, one ean get 


hfci = Uf (hfc -f fifc) 


(34) 
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where is as defined in (3). Now if we apply a = OK trainings and introduee the digital channel 
estimator of the kth. UE W^, we can express the estimate of as 

hfc = Wf hfc (35) 

where ‘U = [Ui, U 2 , ■ ■ ■ , U 51 ]. The design of tj and ean be examined 

like in (22) and (23) for perfeet and imperfeet R^. eases, respeetively. Eurthermore, it ean be shown that 
the joint optimization of tj and will result to a non-eonvex problem. Eor this reason, we preseleet 
tj and optimize W^. for fixed tj like in the above seetion. To this end, we ehoose tj as the first ONjip 
columns of U, where 

w* =[W*, W*, ■ ■ ■ , W^] = UDV^, (36) 

D is a diagonal matrix eontaining the non-zero singular values of W* arranged in deereasing order, and 
are the solutions obtained from ( 6 ) for perfeet R^. ease (by solving ( 12 ) for imperfeet R^. ease). 
Onee the ehannel estimation is performed, we will have the following normalized total MSE 

k=l 

B. Training-Throughput Tradeoff 

This subseetion diseusses the training-throughput tradeoff for multiuser seenario with perfeet R^. Eor 
better exposition of the proeeeding diseussions, here we also assume that a is a multiple of K. Under this 
assumption and after doing some mathematieal manipulations, we ean rewrite the eombined estimated 
ehannel of all UEs H = [hi, h 2 , • ■ ■ , hi^] obtained from (35) as 

H = MHd + WNd (38) 

where M = [Mi, M 2 , • ■ ■ , M^^] with M^ = Wf tj^, W = [Wi, W 2 , • ■ ■ , W^^], = blkdiag(hi, ha, • • ■ 

and Nrf = blkdiag(hi, ha, • • ■ ,hi^). If we again assume that the ehannels of all UEs are uneorrelated 
whieh is usually the ease in praetiee, we will have 


E{H^H} = diag(ti,t2, • • ■ E{(MH,)^H} = E{H^MH,} = diag(ti,f2, • • ■ ,tir) (39) 
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where h = tr{RfcMfMJ + WJ and tk = tr{RfcWfU^}. 

The data transmission phase can be either in the uplink or dowlink ehannel. To this end, we employ 
the well known ZE approach in both the downlink and uplink transmission phases, respeetively as 

B =kH(H^H)~^ DL - ZF, A = UL - ZF (40) 


where k = is introdueed to maintain the BS average transmitted power to unity, 

and DE-ZF (UE-ZE) denotes the downlink (uplink) zero foreing. 

With these beamfoming matriees and using (39), the downlink and uplink average SNR of the kth UE 
are given as 


K, 


E{7fc} 


(7 


d L 


H^H(H^H)-2h^H 


\tk 


k,k 


DL - ZF and UL - ZF. 


(41) 


And the average normalized throughput beeomes 

K 


Th =E 


^ ^ i=\ ^ ^ i=l 


Like in the single user ease, Ejyfc} depends on tk and tk whieh are funetions of U. This validates that 
E{ 7 fc} depends on the training duration a whieh may need to be optimized for maximizing Th as 

{7-aT,) ^ 


max 


T. 


5^1og2(l + E{7*})- 


(43) 


2 = 1 


For better exposition on the optimal a for multiuser systems, we consider the following lemma. 

Lemma 3: When a = 9K and the ehannel of eaeh UE is i.i.d zero mean Rayleigh fading (i.e., R^ = g^I), 
(43) is a strietly eoneave optimization problem where the optimal 9 is unique and ean be obtained by 
simple biseetion seareh method. Speoifieally, when Ejyfc} 3> 1 (i.e., at high SNR), the optimal 9 satisfies 


^-K\og,{9) = K 


1 + log2 A 




K 


1 + log2(0) +log2 -E 


O'; 


9 


where q = - 


\K qi+cr^ 




-1 


Proof: See Appendix B 


(44) 
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Erom (44) one ean notiee that the optimal 6 deereases when q inereases (i.e., the optimal training 
duration inereases as the SNR deereases). This faet has also been demonstrated both in the numerieal and 
simulation results by eonsidering practically relevant SNR regions. 

Eor the fixed q, a'j and 9 (i.e., independent of K), one ean notiee from (44) that j3 may need to inerease 
as K inereases to maintain the above equality whieh is required at optimality. Erom this diseussion, we 
ean understand that the maximum possible value of {3 for the mmWave and mierowave frequeney bands 
may vary from one seenario to another (i.e., eannot be determined a priori). However, the minimum value 
of 13 should be the same as that of the optimal training duration obtained from the fully digital arehiteeture 
ease. This is beeause the digital arehiteeture has more degrees of freedom (whieh eonsequently help utilize 
the minimum training duration) than that of the hybrid arehiteeture. 

In the multiuser setup, providing analytieal proof showing that (43) is eoneave for general settings is 
not trivial as the £{ 7 ^} of (28) depends on K, R^, af and crj. Nevertheless, sinee eomputing E{ 7 fc} 
numerieally is quite simple for the given R^, it is still possible to get the optimal 9 by examining Th 
numerieally. And as will be demonstrated in the numerieal and simulation seetions, problem (43) is still 
eoneave for praotieally relevant parameter settings. 

Note that the analog ehannel estimation matrix tj* is realized using RE eleetronie eomponents whieh 
usually introduee some delays when the eorresponding matrix is switched from Uj to Uj+i. Eor instanee, 
in the single user setup, Uj may need to be updated every Tg. However, one ean still treat y* of (17) as 
the reeeived signal obtained at eaeh measurement period whieh eonsists of Tg and possible delay 
(i.e., Tm = Tg + Td). And, from the diseussions of Seetions IV and V, one ean learn that the optimal 
training duration does not depend on the speeifie values of Tm{Tg). Eor this reason, the introduetion of 
delays in the analog eomponents will not affeet the analysis of the eurrent paper. We would also like to 
point out that adaptive eompressive sensing method has been utilized in [24] where the authors employ 
the number of measurements (observations) to faeilitate the ehannel estimation. This paper also follows 
the assumption that the analog beamforming matrix be varied for eaeh measurement (see (9) of [24]). 



Main manuscript: Accepted in IEEE Transactions on Communication 


Extension for the case where a ^ 9K: The channel estimation and training-throughput tradeoff 
discussions assume that a is a multiple of K. However, it may also be interesting to consider the case 
where a ^ 9K. In such a case, we propose to allow only the first K = rem(^) UEs transmit their 
orthogonal pilots during the channel estimation phase when {9 — 1)K < a < 9K,W9. And these K UEs 
can be selected based on their equivalent channel gains (i.e., trjUl^Rfctje}). By doing so, each UE’s 
channel vector will not experience interference from the other UE’s channel. To compute the optimal a, 
we suggest to use a two step approach, i.e., first we compute 9 using (43) (fast searching), then we tune 
a in the range {9 — 1)K < a < {9 + 1)K (tuning). By doing so, the complexity of searching the optimal 
a can be reduced significantly^. 

Complexity: The computational complexity of the considered training-throughput tradeoff arises from 
solving the channel estimator discussed in Section III-C and the complexity of Appendix A (B) (i.e., 
computation of the optimal training time). However, since the optimal solution computed from Appendix 
A (B) can be obtained using bisection search which has negligible computational load [49], the overall 
complexity of the training-throughput tradeoff study is the same as the one discussed in Section III-C. 

We would like to recall here that the training-throughput tradeoff is studied by utilizing the normalized 
throughputs Th given in (31) and (42). Since each of these expressions has an outer expectation term, 
optimizing Th as it is appears to be challenging. Eor this reason, we employ Jensen’s inequality to get 
the upper bound of the normalized throughput which has convenient mathematical structure for studying 
the training-throughput tradeoff. Eurthermore, we have demonstrated by computer simulation that the 
examined throughput is tight (see Section VI-A). However, this simulation result cannot act as a general 
proof to ensure the tightness of the upper bound throughput. Therefore, studying the training-throughput 
tradeoff by considering the exact (theoretically tight bound) throughput is left for future research. 

Up to now, we assume that the covariance matrix of each UE is known either perfectly or imperfectly. 

*As will be clear in the numerical and simulation sections, the optimal a decreases as the rank of Rfc decreases. This fact again can he 
exploited to get the optimal a with reduced complexity. 
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In faet, when are known and available a priori (for example in the i.i.d Rayleigh fading ehannel 
environment), there is no need to estimate the eovarianee matrix. However, this may not happen for all 
seenarios sueh as an indoor environment. This validates the need to estimate the eovarianee matrix. In this 
regard, we have provided a brief summary of the sample ehannel eovarianee matrix estimator for eaeh 
UE when < iV in Appendix 

Extension to Multiuser MIMO Case: In the analysis of this paper, it is assumed that eaeh of the 
UEs is equipped with single antenna. However, the deployment of multiple antennas are adopted in 
different existing standards. In mierowave frequeney bands, as eaeh of the UEs is likely equipped with 
few number of antennas, the eonventional digital arehiteeture is still a reasonable solution. Eor sueh a 
ease, one ean apply the training-throughput tradeoff study of the eurrent paper just by treating the number 
of antennas as the total number of ’’virtual single antenna UEs”. However, at mmWave frequeneies, a 
hybrid arehiteeture can be deployed at eaeh UE. Eor sueh a ease, one ean study the training-throughput 
tradeoff by eonsidering two systems: The first system assumes that eaeh UE utilizes prefixed analog 
ehannel estimation and beamforming matriees. This analog matrix ean be seleeted by exploiting the 
history of ehannel eovarianee statisties (sueh an approaeh drastieally reduees the power eonsumption and 
eomputational load at eaeh UE). Eor this system, one ean notiee that the analysis of the eurrent paper ean 
be applied straightforwardly. The seeond system does not impose any a priori assumption on the analog 
arehiteeture of eaeh UE. In general, such an assumption leads to a superior performanee at the expense 
of inereased eomplexity. And studying the training-throughput tradeoff for this system is still an open 
problem and is left for future work. 

VI. Numerical and Monte Carlo Simulation Results 

This seetion provides numerieal and Monte Carlo simulation results of the normalized MSE and 
throughput expressions. In this regard, we set Nrf = K, af = a^, /3 = 2KTo, a = 9K and we vary 

''’We would like to point out here that as the analysis of the current paper depends on R* which is the same for all sub-carriers, the 
training-throughput tradeoff study is also valid for frequency selective channels. 
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To, N, K, 6, Rfc, where To is introdueed for eonvenienee. To this end, are taken from a widely used 
exponential eorrelation model as Hk = Pk > where 0 < pfc < 1. We have used exponential eorrelation 
model beeause of the following two reasonsEirst, the exponential model is physieally reasonable in a 
way that the eorrelation between two transmit antennas deereases as the distanee between them inereases 
[44]. Seeond, this model is a widely used antenna eorrelation model for an urban area eommunieations 
where traffie is usually eongested [56]. We utilize the SNR whieh is defined as ^ (i.e., the multiuser 
version of x in (26)) and normalized training duration (i.e., 9). 

A. Numerical Results 

This subseetion presents the numerieal results obtained from the normalized MSE and throughput 
expressions for the seenario where R^ are known perfeetly. 

1) Single User: The aehieved normalized MSE for = 64 and To = 64, and different 6, p = pi and 
SNR is plotted in Eig. 2. As ean be seen from this figure, inereasing 9 improves the quality of the ehannel 
estimator by redueing the MSE whieh is expeeted. On the other hand, for a given operating SNR, the 
required training duration to aehieve a given target MSE deereases as p inereases. This result is expeeted 
beeause as p inereases, ^ with Ai > A 2 , • ■ ■ Aat deereases with i whieh eonsequently help to reduee the 
training duration. Eig. 3 shows the aehievable throughput versus training duration for different SNR and 
P- 

As ean be seen from this figure, despite the eonventional ehannel estimation approach, the optimal 
training duration of the proposed hybrid analog-digital ehannel estimator is not neeessarily the same for 
all p. In partieular, the optimal training duration deereases as the rank of the ehannel eovarianee matrix 
deereases (i.e., as p inereases). As an example, when SNR=10dB, the optimal training durations are around 
ATs, 12Ts and 20Ts for p = 0.9, 0.5 and 0 (i.e., i.i.d Rayleigh fading ehannel), respeetively. On the other 
hand, less training period is employed when the UE’s ehannel is highly eorrelated whieh eonsequently 

*'We have also applied the analysis of the current paper for ULA and uniform planar array (UFA) channel models, and have found similar 
training-throughput tradeoff characteristics to that of the current paper. More detailed results can be found in the next section. 
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Fig. 2. Normalized MSE versus training duration of the hybrid Fig. 3. Normalized throughput of the hybrid analog-digital 
analog-digital channel estimator for single user case. channel estimation and beamforming for single user case. 

results a higher normalized throughput. These results eonfirm that the training period should be seleeted 
earefully when the BS has less number of RE ehains than that of antennas. 

2) Multiuser: This subsection examines the training-throughput tradeoff for multiuser case. To this end, 
we set A' = 4 To = 32, normali z ed SNR of all users as {10, 20}dB, and three sets of p as = [0, 0, 0, 0], 
P 2 = [0.5, 0.5, 0.3, 0.5] and P 3 = [0.85,0.9,0.75,0.95]. Eig. 4 shows the througput versus normalized 
training for these settings. As we ean see from this figure, the optimal training duration depends on 
the ehannel eoherenee time. Eurthermore, like in the single user ease, the optimal 6 for multiuser ease 
deereases as the eorrelation eoeffieients of all UEs inerease. On the other hand, for a given p, the optimal 
training duration is not neeessarily the same for all SNR regions. Eor instance, when p = P 2 , the optimal 
6 *s are around 18 and 25 for the 20dB and lOdB normalized SNR values, respectively. This observation 
validates the optimal strueture of 9 whieh is provided in Lemma 3 for p = 0. 

B. Simulation results 

This subseetion demonstrates the theoretieal normalized MSE and throughput expressions via Monte 
Carlo simulations. All of the results are obtained by averaging 20000 channel realizations. Eor this 
subsection, we employ N = QA, K = A and Tc = 2SQTs, and the training symbols P are taken from the 
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Fig. 4. Normalized throughput of the hybrid analog-digital 
channel estimation and beamforming for multiuser case. 



Fig. 5. Theoretical and simulated normalized MSE of the 
hybrid analog-digital channel estimator with perfect p. 


discrete Eourier transform (DET) matrix with appropriate size. 

1) Channel Estimation: The ehannel estimation phase is examined both for perfeet and imperfeet 
seenarios. To this end, this subseetion verifies the theoretieal normalized MSE expressions, and validates 
the effeetiveness of the proposed robust ehannel estimator eompared to that of the non-robust one. Eig. 5 
shows the theoretical and simulated normalized MSEs for different 9 when the SNR is set to lOdB and 
20dB, and is known perfeetly and is eomputed from p = [0.4, 0.6, 0.2,0.7]. As can be seen from this 
figure, the simulated normalized MSEs match that of the theoretieal ones for both SNRs. 

In the next simulation, we examine the effeet of eovarianee matrix estimation error on the MSE of 
the robust and non-robust designs. Towards this end, we eompare the MSE achieved by the robust and 
non-robust channel estimators. Eig. 6 shows the achieved normalized MSEs for the estimated values 
Pi = [0.40,0.60,0.20,0.70], Pa = [0.75,0.90,0.70,0.95] and Pg = [0.95,0.96,0.90,0.98] while setting 
Cfc = a/TV. And the true R^^ = -f A^, where ||Afc|| < e^. As ean be seen from this figure, the 
robust design aehieves less normalized MSE eompared to that of the non-robust one, and its improvement 
is higher when p is higher. This is beeause as p is higher, the preseleeted tj in (22) will be eloser to its 
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- Non-robust (p^) 


2 - Perfect (p^) 

- B - Non-robust (p„) 



10 20 30 40 50 60 

Normalized Training duration p) 


Fig. 6 . Comparison of perfect, robust and non-robust hybrid 
analog-digital channel estimators. 



Fig. 7. Theoretical and simulated normalized throughput of 
hybrid analog-digital channel estimator and beamformer. 


optimal value whieh eonsequently helps to improve the performance of the robust ehannel estimator*^. 
As expeeted, the lowest normalized MSE is aehieved when p is known perfeetly. 

2) Data Transmission: In this subsection, the throughput expression given in (42) is verified via 
eomputer simulations. We also demonstrate the optimal training duration and examine the tightness of 
the average upper bound throughput given in (42) with simulations. In this regard, we ehoose p = 
[0.40, 0.60, 0.20,0.70] and eonsider the ease where is known perfeetly. To plot the simulation normal¬ 
ized throughput, first we compute the SNR from Monte Carlo simulation for eaeh channel realizations, 
then we employ the first equality of (42) for the evaluation of the average normalized throughput. 

Eig. 7 shows the theoretieal and simulated normalized throughput versus training when Rfc is known 
perfeetly. This figure demonstrates that the bound derived in (42) is tight sinee the simulated normalized 
throughput is almost the same as that of the upper bound throughput for both the downlink and uplink 
ZE beamforming sehemes*^. On the other hand, the optimal 0 that maximizes the average throughput 
deereases as the SNR inereases whieh is expeeted. Erom Eig. 7, one ean also realize that utilizing the 

*^The non-robust channel estimator is the design that does not take into account in the design. And the perfect channel estimator 
corresponds to the case where tk = 0 . 

*^The cause of this throughput tightness is likely due to the fact that massive MIMO systems become less sensitive to the actual entries 
of the channel matrix which yields £{ 7 } « 7 (i.e., channel hardening [4]). 
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solution of [32] (i.e., 6 = 1) yields a significant reduction in throughput. 

Note that the simulation results of this paper assumes that tj is realized with suffieiently high (theoreti¬ 
cally infinite) resolution phase shifters. When the hybrid arehiteeture employs constant phase phase shifters 
and switehes, eaeh of the elements of tj ean be realized with some aeeuraey only [23]. Speeifieally, as 
the required aeeuraey inereases, the number of fixed phase phase shifters and switehes inerease. When the 
aeeuraey is set to 10“^ (i.e., the absolute differenee between tj^ and its approximated value), we have 
found almost the same MSE and throughput as those of the plots of this seetion. The simulation results 
demonstrating this faet has been omitted for eoneiseness. 

VIE Extra Simulation results for UEA Channels 

Millimeter wave ehannels are expeeted to be speeular and have low rank, and may not neeessarily 
follow the same eovarianee strueture as that of the exponential channel eorrelation models diseussed in 
the above seetion. In this regard, this seetion provides some simulation results on the evaluation of the 
training throughput tradeoff for mmWave ehannels. To this end, we eonsider a UEA having different 
number of seatterers (L^) (all the other settings are the same as that of the above seetion). The AOD of 
eaeh UE is assumed to have arbitrary phases taken from the uniform distribution in [0, 27r]. 

Eig. 8 shows the training versus throughput eurves for different number of seatterers. All the other 
parameter settings are as in the first paragraph of Seetion VII of the revised manuseript. As ean be seen 
from this figure, the optimal training duration inereases as Lk inereases whieh is expeeted. This is beeause, 
the rank of the ehannel inereases as inereases which fits with the theoretieal result provided in the 
above seetion. 


VIII. Conclusions 

This paper eonsiders hybrid analog-digital ehannel estimation and beamforming for multiuser massive 
MIMO systems with limited number of RE ehains. Under these settings, first, we design novel TDD MMSE 
based hybrid analog-digital ehannel estimator by eonsidering both perfeet and imperfeet ehannel eovarianee 
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(a) (b) 




(c) (d) 

Fig. 8. Theoretical and simulated normalized throughput versus training of the hybrid analog-digital channel estimator and beamformer 
with ULA channel model having different number of scatterers (Lfe). (a): Lk = 4, (b): Lk = 12, (c); Lt = 24, (d): Lk = 64. 


matrix cases. Then, we utilize the estimated ehannels to exploit beamforming for data transmission. 
Under the assumption of perfeet ehannel eovarianee matrix, we show that there is a tradeoff between the 
training duration and aehievable throughput when the number of RE chains is limited and hybrid analog- 
digital ehannel estimation and beamforming is applied. Speeifieally, we exploit the faet that the optimal 
training duration that maximizes the overall network throughput depends on the available number of RE 
ehains, ehannel eoherence time and eovarianee matriees of all UEs. We also show that the training time 
optimization problem ean be formulated as a eoneave maximization problem for some system parameter 
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settings where its global optimal solution can be obtained effieiently using existing tools. The analytieal 
expressions are validated by performing extensive numerieal and Monte Carlo simulations. The robustness 
of the proposed robust ehannel estimator is demonstrated using computer simulations. 


Appendix A 


Proof of Lemma 2 


As ean be seen from (28), Ejy} is the same for both MRC and MRT approaehes. Eor this reason, 
this appendix provides the proof of Lemma 1 when the BS has 1 RE ehain and employ downlink data 
transmission. After some mathematical manipulations, the average SNR ean be expressed as 

r / \ 2 H 


E{7} 


E|h"b|'- 


Ea 

i=l 




+ Eti 




(Ta 


(Ta 


E a 

i=l 


9i+^t9^ 


(45) 


where Qi are the eigenvalues of R sorted in deereasing order (i.e., gi > g 2 9 n)- 


A. When h is an Lid Rayleigh fading channel 

Under an i.i.d Rayleigh fading ehannel, we will have gi = 92 = ■ ■ ■ , = gN — 9 and Ejy} beeomes 



Thus, we ean rewrite (32) as 


(/3 o) , I i,\ ^ -f/ \ 

max---log2(Q;a + 0) = j (o;) 

a p 


(46) 


where a = 2 /! and b = a/* 2 , 


+ 1. Erom the seeond order derivative of f{a) we get 


'21 

da^ 


a / 2 3 — a a 


aa + b\/3 [5 aa + b 


(47) 


As a > 0 and 6 > 0, ^ < 0 for 0 < a < /3. Therefore, (46) is strietly eoneave funetion, and the optimal 


a is unique and obtained by applying simple biseetion seareh approaeh [49], [57]. 
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B. Arbitrary h 

In the following, we show that (46) is strietly eoneave for an arbitrary R (i.e., h) when ELi > 

(i.e., at very high SNR). Eor such a setting, Ejy} of (45) ean be expressed as 

^ ct a 

^{l] =^^9i = (48) 

^ i=l i=l 

where ^max = = — with 0 < 5^ < 1. We again assume that £{ 7 } 1. Under such assumption, 

we ean reexpress the optimization problem (32) as 

max {P - a)[\og 2 {'ymax) + log 2 (/(a))] = /(a) (49) 

a 

where /(a) = The first and seeond derivatives of /(a) ean be obtained by applying a finite 

differenee method. Upon doing so, we will have 

=(/) - a)[log 2 (/(a + 1)) - log 2 (/(a))] - [log 2 ( 7 max) + log 2 (/(a))] (50) 

f (cx^ — — — 

^^2 log2[/(a + 2)] - 2{P-a + l) log^ifia + 1)] + (/3 - a + 2) log2[/(a)] 

= {P- a)(log2[/(a + 2)] - log2[/(a + 1)]) - (/3 - a + 2)(log2[/(a + 1)] - log2[/(a)]) 

<{P - a)(log 2 [/(a + 1 )] - log 2 [/(a)]) - {P - a + 2 )(log 2 [/(a + 1 )] - log 2 [/(a)]) 

= - 2(log2[/(a + 1)] - log 2 [/(a)]) < 0 (51) 

where the fourth and last inequalities are beeause of the faet that 5i are arranged in deereasing order and 
/(«) </(« + !) < /(« + 2). As 7^ — /(“) again eoneave funetion and the optimal a ean be 

obtained by equating 77 = 0 as in (33). 
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Appendix B 
Proof of Lemma 3 


Erom (39), one ean obtain 

h = tr{RfcWf U^} = tr{U^R2u(U^RfcU + a^I)} 

4 = tr{RfcWf U^} + a^trjWf RfcWfc} 

= tr{U^RfcU(U^RfcU + a2l)-^U^R^U(U^RfcU + + (T^r{U^R^U^(U^RfcU + 


When hfc are i.i.d zero mean Rayleigh fading, R^ ean be represented by a sealed identity matrix (i.e., 

_ „2 

Rfc = gfcl). Consequently, tk and tk can be expressed as tk = tk = 0 ^2 - By plugging these expressions 
into (39), one ean represent the average SNR of the kih. UE as 


E{7<=} =;3»- 


(52) 


where q is as defined in Lemma 3. By defining fk{0) = {/3 — OK) log 2 (l + Ejyfc}), the optimization 


problem (43) ean be rewritten as 


K 

max^/fc( 0 ). 

i=l 

After some manipulations, the first and seeond derivatives of fk{d) can be expressed as 


dfkje) 

de 
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de^ 


= - A'log2(l + E{7fc}) + 


= -K 


dE{7fc} 


1 + E{7fc} 

-1 
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1 + E{ 7 fc} [ 


+ {f3- 9K) 
du 


f3-9K dE{jk} 

1 + E{7fc} d9 
-K dE{74 
1 + E{7fc} d9 
d'E{74 
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/3-9K d^E{^k} 

1 + E{74 d9^ 


(53) 


(54) 


It follows that 


dE{7fc} _ q d^E{ 7 fc} _ 
d9 ’ d9‘^ 

By substituting and into (54), one ean obtain < 0 in the desired region 9K < (3. 

Henee (53) is strietly eoneave funetion and the optimal 9 ean be obtained using biseetion approaeh from 
Eli, ^ = 0 [49], 
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Next we examine the optimal Q when E{ 7 fc} 3> 1 (at high SNR regions). Eor these regions, the optimal 
d will satisfy 

+ E{7t}) + = 0 ^ - A'log,(E{ 7 i}) + = 0 


k=l 


1 + E{7fc} d9 


- A-logj(9) = K 


e 


1 + log2 ^ 


a 


d / J 


(55) 


Appendix C 

Covariance Matrix Estimation 

As we ean see from Eig. 1, the uplink and downlink data, and uplink pilot transmissions are exeeuted in 
eaeh Tc. This shows that the BS ean exploit the uplink pilot and deeoded data symbols to obtain suffleient 
samples for eomputing the eovarianee matrix of eaeh UE. Erom the uplink pilot, one ean have whieh 
is defined as (35). Eor the given h^, we suggest a simple least squares estimation approaeh to maintain 
the statistieal behavior of as 

hkt = U(U^U)-'hfc = Uhfc. (56) 

The seeond equality is due to tj^tj = I. Also, during the uplink data transmission, we will have 

Yd = U^(HX^ + N) (57) 

where is the analog unitary matrix in the uplink data transmission phase [23] and it may vary from 
one eoherenee time to the other, and X = [xi,X 2 , • ■ ■ ,:s.k] are the transmitted symbols of all UEs with 
Xfc G In praetiee, as eaeh UE transmits its own data symbols independently, x^ are likely to be 

uneorrelated. Eor this reason, the term that eontains ean be obtained from 

kd = 7 ^U(U^U)-'YrfXfc ^ UU^(hfc + Nxfc) (58) 

Xfc I 

where x^ are the deeoded data symbols of the kth UE. Note that when f3 — a is higher, the seeond 
approximation term of this expression is eloser to equality. 



Main manuscript: Accepted in IEEE Transaetions on Communieation 


Erom the uplink pilot and data transmission phases of L frames, one ean eompute the following sample 
eovarianee matrix for the kth UE 

Rfc = + Rfcd) (59) 

^ z ZH _ ^ ~ ~H z z z 

where = Y.i=i ^kit\iv R-fcd = Ei=i ^kidhkid^ and Yiku and \ikid are the obtained from (56) and 
(58) for eaeh duration, respeetively. It is elearly seen that R^. eontains both the ehannel and noise 
information. As the noise is assumed to be white, it has an impaet on the eigenvalues of R^ only. Thus, 
one approaeh of estimating the ehannel eovarianee matrix is 

Rfc = v^Ei+vf + v,r,+vf (60) 

where Tt{Td) and Vt(Vrf) are the eigenvalue and eigenveetor matriees of Rfct(Rfcd), respeetively, = 
IFj - and r,+ = |r, - 

We would like to mention here that the eurrent paper proposes a partieular approaeh to estimate R^. And, 
as sueh an estimate is not neeessarily optimal, better estimation aeeuraey ean be aehieved by eonsidering 
different approaeh (for instanee by ineorporating weights in (59)). Eurthermore, the estimation of from 
Rfc is still an open researeh topie and is left for future researeh. 
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